Numerical performance of penalized comparison to overfitting for multivariate kernel density estimation

نویسندگان

چکیده

Kernel density estimation is a well known method involving smoothing parameter (the bandwidth) that needs to be tuned by the user. Although this has been widely used, bandwidth selection remains challenging issue in terms of balancing algorithmic performance and statistical relevance. The purpose paper study recently developed method, called Penalized Comparison Overfitting (PCO). We first provide new theoretical guarantees proving PCO performed with non-diagonal matrices optimal oracle minimax approaches. then compared other usual methods (at least those which are implemented R-package) for univariate also multivariate kernel on basis intensive simulation studies. In particular, cross-validation plug-in criteria numerically investigated PCO. take home message can outperform classical without additional cost.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Feature significance for multivariate kernel density estimation

Multivariate kernel density estimation provides information about structure in data. Feature significance is a technique for deciding whether features – such as local extrema – are statistically significant. This paper proposes a framework for feature significance in d-dimensional data which combines kernel density derivative estimators and hypothesis tests for modal regions. For the gradient a...

متن کامل

Bandwidth Selection for Multivariate Kernel Density Estimation Using MCMC

Kernel density estimation for multivariate data is an important technique that has a wide range of applications in econometrics and finance. However, it has received significantly less attention than its univariate counterpart. The lower level of interest in multivariate kernel density estimation is mainly due to the increased difficulty in deriving an optimal datadriven bandwidth as the dimens...

متن کامل

Penalized semiparametric density estimation

In this article we propose a penalized likelihood approach for the semiparametric density model with parametric and nonparametric components. An efficient iterative procedure is proposed for estimation. Approximate generalized maximum likelihood criterion from Bayesian point of view is derived for selecting the smoothing parameter. The finite sample performance of the proposed estimation approa...

متن کامل

Approximate inference of the bandwidth in multivariate kernel density estimation

Kernel density estimation is a popular and widely used non-parametric method for data-driven density estimation. Its appeal lies in its simplicity and ease of implementation, as well as its strong asymptotic results regarding its convergence to the true data distribution. However, a major difficulty is the setting of the bandwidth, particularly in high dimensions and with limited amount of data...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Esaim: Probability and Statistics

سال: 2023

ISSN: ['1292-8100', '1262-3318']

DOI: https://doi.org/10.1051/ps/2022018